Exploring spatio-temporal impact of COVID-19 on citywide taxi demand: A case study of New York City

Coronavirus disease 2019 (COVID-19) has brought dramatic changes in our daily life, especially in human mobility since 2020. As the major component of the integrated transport system in most cities, taxi trips represent a large portion of residents’ urban mobility. Thus, quantifying the impacts of COVID-19 on city-wide taxi demand can help to better understand the reshaped travel patterns, optimize public-transport operational strategies, and gather emergency experience under the pressure of this pandemic. To achieve the objectives, the Geographically and Temporally Weighted Regression (GTWR) model is used to analyze the impact mechanism of COVID-19 on taxi demand in this study. City-wide taxi trip data from August 1st, 2020 to July 31st, 2021 in New York City was collected as model’s dependent variables, and COVID-19 case rate, population density, road density, station density, points of interest (POI) were selected as the independent variables. By comparing GTWR model with traditional ordinary least square (OLS) model, temporally weighted regression model (TWR) and geographically weighted regression (GWR) model, a significantly better goodness of fit on spatial-temporal taxi data was observed for GTWR. Furthermore, temporal analysis, spatial analysis and the epidemic marginal effect were developed on the GTWR model results. The conclusions of this research are shown as follows: (1) The virus and health care become the major restraining and stimulative factors of taxi demand in post epidemic era. (2) The restraining level of COVID-19 on taxi demand is higher in cold weather. (3) The restraining level of COVID-19 on taxi demand is severely influenced by the curfew policy. (4) Although this virus decreases taxi demand in most of time and places, it can still increase taxi demand in some specific time and places. (5) Along with COVID-19, sports facilities and tourism become obstacles on increasing taxi demand in most of places and time in post epidemic era. The findings can provide useful insights for policymakers and stakeholders to improve the taxi operational efficiency during the remainder of the COVID-19 pandemic.


Introduction
Since 2020, the COVID-19 pandemic has spread globally with more than 279 million confirmed cases and 5.4 million deaths as of December 29, 2021.The majority of cases and deaths to date have been reported in the United States, India, Brazil and United Kingdom.The pandemic has undoubtedly brought enormous impacts and changes to economy, society, daily life and urban systems.As economic development progresses, urban transportation systems have been increasingly refined [1].Concurrently, the outbreak and spread of the pandemic have precipitated unparalleled alterations in these transportation systems, a critical component of urban infrastructure, during the epidemic period.Human mobility, travel demand and trip pattern are being reshaped as well.Taking New York City (NYC) for instance, the decline rate of peak transit ridership reached to 92% in the week of April 6 th , 2020 to April 12 th , 2020, compared to the same period in 2019 (Wang et al. [2]).Apart from the decrease of vehicles on roads, this epidemic also led to fewer public transport demand (Paul et al. [3]), changes in trip modes and travel destination decisions (Simons et al. [4]).Therefore, it is necessary to consider COVID-19 as one of the influencing factors when exploring urban mobility and optimizing public transportation.Meanwhile, human mobility can be modelled as a mixture of transportation modes, while taxi is always one of the most important travel modes in most major cities.As a result, this paper attempts to investigate the impact of COVID-19 epidemic on the travel demand variation in post pandemic era, through a case study focusing on taxi services.
We focused on how COVID-19 influenced human mobility in New York City using a set of open data collected from Aug 1 st , 2020, to July 31 st , 2021.NYC was selected for the case study for several reasons, the most obvious of which is that NYC was typical, affected seriously by the epidemic and also one of the metropolises around the world.Another reason to choose NYC is the availability of data.NYC has been continuously tracking its entire taxi fleet since early 2009, while other kinds of related data is sufficient and easy to access (economy, population, GIS, epidemic, etc.).
Furthermore, Geographically and Temporally Weighted Regression (GTWR) approach is adopted to explore the spatio-temporal impact of COVID-19 on taxi demand.A total of 15 typical influencing factors were selected as the independent variables, including population density, COVID-19 case rate, etc.Using the principle of GTWR, a time-space taxi demand model was built based on 12 valid ones among these 15 independent variables.From time dimension and space dimension, these 12 types of explanatory variables' effects on taxi demand were analyzed.Concentrating on the effect of this outbreak, COVID-19's marginal effects on NYC's taxis were also analyzed spatially and temporally.At the end of our research, reasonable suggestions were given for this post-pandemic era.
There are two significances derived from this study, which are taxi demand analysis in post pandemic era and time-space effect of coronavirus on taxi demand respectively.By modeling the relationship between taxi demand and its influencing factors under the pressure of this epidemic, it become more reasonable to dispatch the whole city taxis, reducing the needless waiting time and consequent cost; and by studying this outbreak's spatiotemporal effect on taxi demand, combined with some rationalization proposals, some laws and experience about urban taxi operation on dealing with COVID-19 can be obtained, which will be useful when facing serious health emergencies next time in the future.
The rest of this paper contains aspects as follows: related representative studies about this pandemic's effect on transportation, and principle and application of GTWR model are shown in Section 2. Data source, model variables visualization and selection are demonstrated in Section 3. Section 4 introduced GTWR model and the generation of the taxi demand model.Temporal analysis, spatial analysis and marginal effect of COVID-19 on taxi demand are illustrated in Section 5.According the model results and spatio-temporal analysis, some general laws and reasonable suggestions are obtained in Section 6 finally.

Literature review
Since the coronavirus outbreak, many researchers have correlated COVID-19 data with the traffic pattern variation and taken it as an influencing factor in urban transportation analyses.With the data from traffic counters, public transport ITS and traffic control cameras, Aloi et al. [5] revealed an urban mobility fall of 76%, and cities' public transport users dropped by up to 93%.Using official and secondary data in seven most populated cities in Colombia, Arellana et al. [6] found a distinct reduction of outdoor activities time and there was an about 30% increase in residential activities.In Chicago metropolitan area, Shamshiripour et al. [7] showed that 93% of respondents believed that transportation increased the risk of COVID-19 exposure.
Meanwhile, different models have been applied in analyzing the influence of COVID-19.Guo Y. et al. [8] proposed a Community Activity Score (CAS) and applied Negative Binomial models to model and forecast the spread of COVID-19 in Honolulu.Based on the Decision Tree approach, Pawar et al. [9] investigated the changes of traffic mode choices and found 5.3% of commuters shifted from public to private modes after the epidemic.With the Long Short-Term Memory Networks (LSTM), a type of recurrent neural networks (RNNs), Yao et al. [10] developed an accurate deep learning model and discovered that the daily confirmed COVID cases and daily fatality rate is highly related to the traffic volume in Detroit.Using a statistical method, i.e., bivariate analyses, Cusack [11] stated that nearly half of respondents changed their commute mode during the pandemic in Philadelphia.Chen et al. [12] quantified various factors (such as environmental, economic, social impacts, recycling modes) using an integrated framework supported by the analytical network process (ANP) and fuzzy comprehensive evaluation model.Zou et al. [13] introduced the CHMM method to analyze the interaction between driving behavior variables, which can better understand and explain driving behavior and original driving patterns, confirming the importance of considering the dependence between different variables.In order to overcome the uncertainty of the model, Wu et al. [14] applied the Bayesian model averaging (BMA) method to consider the advantages of different distributions in headway modeling.Zhang et al. [15] used the Hidden Markov Model with the Gaussian mixture model (GMM-HMM) approach to study the dynamic spatiotemporal characteristics and risk formation mechanism of vehicle lane changing.
Among various methods and diverse research themes in the analyses of COVID-19 influence on transportation, it is particularly meaningful to pay close attention to the COVID-19 data type, i.e., spatio-temporal data, because a confirmed COVID-19 case may influence transportation demand in its surrounding area and it could also affect the transportation pattern in several following days.Therefore, using temporal-spatial modeling methods is useful to analyze this effect.There are indeed some studies about the spatio-temporal analysis of COVID-19.Li S. et al. [16] analyzed spatiotemporal variation of air transportation influenced by the pandemic, and discovered the passenger throughput's changing rate is correlated to confirmed cases' growth.Saha et al. [17] explored the spatiotemporal variations in community mobility in India, and found the mobility towards residential area increased during the lockdown and decreased in unlocking period.Li A. et al. [18] focused on the virus' influence on micro-mobility, like bicycles, and found that activities towards home, park and grocery increased, while those towards leisure and shopping decreased, during lockdown period.
As for spatiotemporal taxi demand analysis, Liu Q. et al. [19] studied the relationship between urban environment and taxi demand, and pointed that taxi demand is high in densely developed areas and more bus stops would reduce taxi demand.Tang et al. [20] used multicommunity spatio-temporal graph convolutional network (MC_STGCN) to predict passenger demand in Shenzhen and New York City, and proved that MC_STGCN model had better performance than classical time-series model and deep learning model.Under the pressure of this pandemic, Yu et al. [21] employed multivariate linear mixed regression to disclose the relationship between taxi demand and COVID-19, and indicated that accumulated cured cases and blocking policy, have significantly influence on taxi demand in Ningbo, China.Zheng et al. [22] analyzed the variation of taxi market in 2019 lockdown and it was shown that even though taxi service reduced to some extent, it rebounded to exceed the pre-epidemic level after the lockdown.Considering that COVID-19 is kind of spatiotemporal data, using spatio temporal modeling methods may have better performance on illustrating the relationship between this pandemic and urban taxi demand and few studies used space-time modeling methods on this point.Therefore, a well-behaved temporal-spatial modeling method to fill in this gap is necessary, and we adopted GTWR model in this research.

Overview
From NYC Taxi and Limousine Commission (TLC), we got taxi trip data from Aug

Dependent variable
The number of pick-up taxies per taxi zone per hour was chosen as the dependent spatio-temporal variable to represent the taxi demand.In order to match weekly timescale COVID-19 data, we took the average of the hourly taxi demand over each week.There are 263 taxi zones in New York City and the sum of taxi demand per taxi zone in this one-year period is shown in Fig 1 .It is clear that in central urban area, Manhattan, and in two airports, LGA & JFK, taxi demand is high.In Fig 2, we also plot the sum of taxi demand per hour in all taxi zones, and find that taxi demand is high in the afternoon but low at midnight.

Independent variable
Considering common influencing factors of taxi demand, population density, ratio of commercial area, road density, transportation stations, 11 types of POIs and COVID case rate were included as our independent variable.

Population density.
Population density is a common variable when analyzing traffic demand.Since census tracts were different from taxi zones, we first divided the number of populations in each tract by its area and hence got corresponding population density.Next, we rasterized population density data in each census tract, and then each pixel got the value of corresponding population density.We then zonal summed up these pixels' values under the boundary of each taxi zone and got the overall value of each taxi zone.Finally, we divided each taxi zone's overall value by the number of pixels and got the population density in each taxi zone (see Fig 3).

Road density.
Road density can reflect the convenience of urban transportation to some extents and it also influences the demand of taxis.We calculate the road density per taxi zone as our explanatory variable whose unit is mile/mile 2 (see Fig 4).

Transportation stations.
Considering there are kinds of public transportation, like buses, subways, railways influencing taxi demand, we first calculated the number of transportation stations per taxi zones.To be specific, this variable includes bus station, bus stop, railway station and subway station.In order to eliminate the effect of area, we finally calculated the density of stations per taxi zone (see Fig 5), and the units are correspondingly number/mile 2 .

POIs.
Since point-of-interest (POI) data can be used to evaluate the land-use type, it is also used as an important factor in our analysis.We chose 11 types POIs from open street map, which is public, education, health, leisure, sport, catering, indoor accommodation, outdoor accommodation, shopping, money and tourism.The details are shown in Table 1.Like the station variable, we also calculated the density of POIs per taxi zone in order to eliminate the effect of area, and their units are number/mile 2 .these variables were larger than 0.7.Finally, catering, finance, shop variables were deleted.Table 2 is the correlation coefficient matrix of remaining variables.

GTWR model
By extending Geographically Weighted Regression (GWR) model with temporality, Huang et al. [23] developed GTWR model and applied it on real spatio-temporal data.Dong et al. [24] built the GTWR model to analyze determinants of haze pollution in China, and found that economic development and industry upgrading are main solutions to reduce haze pollution, while transportation industry and construction industry are main sources of haze pollution.Based on GTWR model, Liu J. et al. [25] found carbon emission intensity in China were influenced by urbanization, population, etc., and energy intensity had positive impact on carbon emission intensity.With the help of GTWR's principle, Guo B. et al. [26] explored the effects of socioeconomic and environment on chronic obstructive pulmonary disease mortality, and found the influence degree of anthropic factors are higher than natural factors.
With usually high goodness of fit and clear explanation of temporal-spatial characteristics, GTWR was also widely used in transportation analysis.Shen et al. [27] adopted GTWR modeling principle to analyze how land use and household properties influenced automobile travel demand, and their high accuracy study showed the influence of above two kinds of factors on travel demand varies temporally and spatially.In order to explore pedestrian injury severity geographically and temporally in Hong Kong, Xu et al. [28] used GTWR and found that it was significantly influenced by vehicles number, speed limit, injury location, etc. Ma et al. [29] utilized GTWR to model the relationship between bike-sharing usage and its determinants, and found that the elderly proportion and entertainment density have different correlation between dock less bike-sharing and docked bike-sharing.Based on GTWR, Ma et al. [30] revealed the relationship between built environment and transit ridership, and stated that temporal heterogeneity of coefficients was the key determinant of transit ridership per TAZ.
With the help of GTWR model, we built a spatio-temporal regression model and spatialtemporal distribution of coefficients generated by the model was used to analyze how different factors influence taxi demand in both time dimension and space dimension.
Generally, GTWR model can be formulated as follows: where y i ; x i1 ,x i2 ,. ..,x im are dependent variable y and independent variable x i1 ,x i2 ,. ..,x im at time is the intercept at time t i at point or grid (u i , v i ) and ε i is the error term at i whose mean is supposed to 0 and variance is supposed to σ 2 .
The estimation of GTWR model is shown as follows: where W(u i , v i , t i ) is space-time weighted matrix like: where d ST ij is spatial-temporal distance between position i and j and h ST is Space-time bandwidth parameter.We can calculate d ST ij by Eq (7) as below.
ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi where λ and μ are scale factors used to balance the effects of temporal and spatial distances.When λ = 0, GTWR model becomes temporal weighted regression (TWR) model, which only considers the time effect; And when μ = 0, GTWR model becomes geographically weighted regression (GWR) model, which only considers the spatio effect.

Data modeling
Based on the principle of GTWR model, our taxi demand analysis model was formulated as Eq (8).
where β 0 (u i , v i , t i ) is the intercept at time i at taxi zone i, β i (u i , v i , t i ) is the regression coefficient at time i at taxi zone i, and ε i is the error term.Table 3 describes one dependent variable (see freq), 12 independent variables, X, Y coordinates, time stamp, and their details.Besides, we calculated the X, Y coordinates of each taxi zone's centroid and recorded timestamps.
From the data description in Table 3, we found that there are too many "0" in data which may influence our model accuracy.In order to eliminate the effect of "0", we normalized our dependent variables using Eq (9).
where x 0 is new data, x is original data, μ is the mean of the sample data set, and σ is the std. of the sample data set.

Model estimation result
As shown in Table 4, we compared the model performance based on OLS, GWR, TWR and GTWR.Considering time effect and space effect, GWR, TWR and GTWR have better model performance than traditional OLS.In addition, while GWR only concentrates on space effect and TWR only concentrates on time effect, GTWR model combines the time-space effect as a whole, which performs better on spatio-temporal data theoretically than the other two models.The model estimation result proves this conjecture.Since GTWR's R 2 is the closest to 1 and its AIC is minimum, it can be known that GTWR model has the best goodness-of-fit among these four models.
The result of GTWR model is shown in Table 5.From the sign of average coefficients and median coefficients of 12 types of dependent variables, station, public, education, health, leisure, indoor accommodation, outdoor accommodation and population are positive, while sport, tourism, road density and COVID-19 case rate is negative, indicating that in post-epidemic era, stations, public facilities, education, health facilities, leisure facilities and accommodation can increase taxi demand, and other factors may reduce the demand.Among these influencing factors' average coefficients, the sign of COVID-19 cases rate's mean coefficient is negative, and the absolute value is the largest, which shows that COVID-19 has been a major factor in reducing the taxi demand in post pandemic era.Corresponding to COVID-19, health facilities, like hospital and clinic, shows a positive sign of average coefficient and its absolute value is relatively large, illustrating health facilities has become one of the major factors increasing taxi demand.The possible reason is that under the pandemic, people try not to go to places with high COVID cases, and their awareness of medical treatment increases.

Spatial feature of coefficients
With GTWR model and geographic information system (GIS), spatial distribution of some important coefficients can be visualized.By calculating the average coefficients of different dependent variables in this one-year time period, which are shown from Figs 8-15, it could be found that the promotion and inhibition degree of each explanatory variable varies from zone to zone.
Station density increases taxi demand in Lower Manhattan and northern Brooklyn, but decreases it near JFK Airport.Public facilities density increases taxi demand in most of New York City, especially in Bronx, but decreases it near the Southwest of Central Park.Education facilities density attracts taxis in most of Manhattan and Queens, but has a negative effect on taxi demand in most of Brooklyn and Bronx.Health facilities density promotes taxi demand in most of NYC, especially in Upper Manhattan and southern Bronx.Leisure facilities reduce taxi demand around Central Park and JFK Airport, but increase it in the Staten Island and southwest of Brooklyn.Sport facilities decrease taxi demand in most of NYC, except for the  As for COVID-19, it reduces taxi demand in most of NYC, except in southwest Brooklyn and some parts of Staten Island.In addition, the reducing degree in area around the Central Park is larger than other areas, and its possible reason is that in central NYC, the epidemic prevention policy is stricter than other area, leading to high level of inhibiting effect on taxi demand.

Temporal feature of coefficients
The results of GTWR can also provide us the temporal changes of the average coefficients.As shown in Fig 16, in terms of daily average regression coefficients, with the largest negative average coefficients, COVID-19 has become the major inhibiting factor of taxi demand in post era; in contrast to this, because of the largest positive coefficients, health has become the major promoting factor of taxi demand.Therefore, under the pandemic pressure, health care and epidemic have become the major influencing factor.
From 0:00 to 24:00, It is shown that COVID-19's coefficients climb up to the maximum from 0:00 to 6:00, then decrease to the minimum from 6:00 to 20:00, and increases from 20:00 to 24:00.Especially when it is 8pm, the average coefficient reaches the minimum, indicating that COVID-19 has the largest reducing degree to taxi demand; also, COVID coefficients at night is much lower than it in daytime, illustrating that the reducing degree at night is much larger than it in daytime.The possible reason of this phenomenon is Curfew policy beginning at 20:00.On the contrary, the coefficients value of health is always the largest, and its changing scope is fewer that COVID-19's.Especially in the afternoon, it reaches the maximum, causing the largest attractive level of taxi demand.
Except for COVID-19, tourism and sport always share negative coefficients, leading to the decrease in taxi demand.It seems to be inconsistent with common sense that some people may tend to take taxis after tiring tour or exercises.Under the pressure of this pandemic, we suppose the potential reason is that the virus makes the tourist area or sports center partially closed, and then these areas no longer attract taxis.Another negative factor is road density and it indeed decrease the taxi demand in most of daytime.Road density can reflect the convenience of local transportation and there will be more kinds and quantity of vehicles, like more buses, more subways and more private cars, in high road density area.These vehicles act as the competitors of taxis and hence road density become one of the inhibiting factors.
Among the promoting factors, most of their coefficients increase at 8:00 and decrease at night, showing a scenario that people begin their daily life at 8am and go home for rest at night.However, leisure factor is on the contrary of this trend: it has less attraction to taxi demand in daytime, but begin to increase its attracting level from 18:00.It can be used to explain this phenomenon that people need to work in daytime from 8:00 to 16:00, when leisure's coefficient is closer to 0, and they enjoy their night life after that.The further inference is that people will still go to leisure area, even though there are Curfew policy and covid virus.From week 0 to week 11, week 13 to week 41, week 45 and week 48, when it is Aug 1 st to Oct 24 th , Nov 11 th to May 22 nd , Jun 13 th to Jun 19 th and Jul 4 th to Jul 10 th , the sign of coefficients is negative, especially in week 4 and 10.But in week 12, week 42 to 44, week 46 to 47 and week 49 to 51, when it is Oct 25 th to Oct 31 st , May 23 rd to Jun 12 th , Jun 20 th to Jul 3 rd , Jul 11 th to Jul 31 st , the sign of coefficients is positive, indicating that covid-19 actually increase the taxi demand.Therefore, it is clear that in most time of autumn, winter, spring, COVID-19 depresses the taxi demand, but in some time of summer COVID-19 actually boosts taxi demand.The potential reason can be attributable to the characteristics of COVID-19 virus.Since people are more susceptible to infection in winter, they share higher awareness to the virus and avoid going to high case rate places in cold days.Moreover, they may also prefer other travel modes to public transportation in winter.Furthermore, some time-variant restraining order may also reduce the taxi demand in those high case rate places.
However, it is irreconcilable with our common sense, which is widely believed that COVID-19 should depress city-wide taxi demand in both summer and winter.In order to reveal the deep reason of this phenomenon, we extract the spatial distribution of average COVID-19's coefficients in week 47 (From Jun 20 th , 2021 to Jun 26 th , 2021) per taxi zones, Along with general spatial distribution, it will also be meaningful to find some differences between morning and evening rush hours, and there indeed is an obvious change in lower Manhattan.In morning rush hours, lower Manhattan is the only place in Manhattan that attracts taxi demand, while it decreases taxi demand in evening rush hour.

Conclusion
This research quantitatively analyzes the impacts of the pandemic COVID-19 on taxi demand temporally and spatially in New York City.After multicollinearity test, we selected 12 types of explanatory variables, including COVID-19 case rate data.Using GTWR modeling principle, a taxi demand analysis model with R 2 of 0.814 and AIC of 2035823 was built, which perform better than traditional OLS, TWR and GWR models.Then, we analyzed this pandemic's spatialtemporal effect on taxi demand from Aug 1 st , 2020 to Jul 31 st , 2021 in all NYC's taxi zones.
Major findings of this research can be concluded as: (1) GTWR has greater fitting performance on spatial-temporal data than traditional OLS, TWR and GWR models (2) COVID-19 and health care become the leading inhibiting and promoting factors of taxi demand in post epidemic era.(3) The inhibitory degree of this pandemic on taxi demand is larger in cold weather than it is in hot weather.(4) The inhibitory degree of this pandemic on taxi demand is severely influenced by the curfew policy and it reaches the maximum at the beginning of curfew at 20:00.(5) Although this virus dampens taxi demand in most of time and places, it still promotes taxi demand in some specific time and places.( 6) Sports and tourism also become obstructive factors on the increase of taxi demand in most of places and time in post epidemic era.With these conclusions, GTWR model is verified to be effective and workable on spacetime impact analysis of the epidemic; and it is suggested that taxis drivers avoid going places with high virus case rate in winter; and more taxis should be dispatched around health center, while fewer taxis should be around tourism and sports area in this post epidemic era.Additionally, the reinforcement of public health infrastructure is crucial, including enhancing the emergency response capabilities of medical systems and improving disease monitoring and prevention.Embracing remote working and digital transformation is essential for adapting to the post-epidemic work and lifestyle changes.This transformation encompasses fostering the technical capabilities of both public and private sectors.Furthermore, the promotion of digital payment methods to minimize physical contact is recommended, alongside advancing the digitization of taxi dispatch and tracking systems to elevate service efficiency and enhance passenger experience.Finally, collaboration with technology providers, government agencies, and other transportation services is essential to explore innovative service models and business strategies, addressing the evolving market demands of the post-epidemic era.Moreover, under the pressure of this pandemic, it can be optimistic that COVID-19 doesn't reduce all taxi demand, but actually boosts it in some specific area and time.
Further research can focus on following fields: (1) Comparing GTWR model with other spatio-temporal models, like Bayesian hierarchy model and Recurrent Neural Network (RNN), and analyzing the advantages of GTWR model.( 2) Using GTWR model to analyze this pandemic's effect on taxi demand in other cities and making contrastive analysis between this effect in NYC and other cities. (3) more transportation entities, like buses and subways, can be included to extend this research's application scenario.(4) The article uses New York City as an example to conduct GTWR analysis.The applicability to other cities needs to be analyzed.The universality of the method can be proved through comparisons in various cities.

Fig 5 .
Fig 5. Transportation station density in NYC.https://doi.org/10.1371/journal.pone.0299093.g005 distribution is shown and Staten Island has a high case rate compared with all NYC's boroughs.In Fig7, we counted the case rate per week in all taxi zones and find case rate is high from week 18 (Nov.29 th , 2020 to Dec. 5 th , 2020) to week 36 (Apr.4 th , 2021 to Apr. 10 th , 2020), which means case rate is high in winter in New York City.3.4Multicollinearity test.Multicollinearity can influence the model accuracy.Here we calculated Pearson correlations among nearly all variables, except time-space COVID-19 variables.The result showed that catering was highly related to station, health, indoor accommodation, shop and finance; finance was highly related to indoor accommodation and shop; and shop was highly related to health.It was because the Pearson correlation coefficients between

5. 4
Marginal spatial-temporal effect of COVID-19 on taxi demand 5.4.1 Seasonal effect.From Fig 7, we know that COVID-19 case rate is high in winter.In order to explore the seasonal effect of COVID-19 on taxi demand, we scale down average taxi demand, average covid case and average covid coefficients per week, and plot them in Fig 17.

Fig 21 .
Fig 21.Average coefficients of COVID-19 in evening rush hour.https://doi.org/10.1371/journal.pone.0299093.g021 1 st , 2020, to July 31 st , 2021.Combined with taxi zones data on NYC open data website, spatio-temporal taxi trip data was acquired.Also from NYC open data, we obtained NYC population distribution, zoning data and road network.Additionally, we collected COVID-19 case rate by MOD-ZCTA data on NYC health department, where MODZCTA is modified ZIP Code Tabulation Area geographies.Finally, bus stations, trains stations and Point of Interest (POI) data were achieved on Open Street Map.